Construction of a Membrane Protein Database and an Evaluation of Several Prediction Methods of Transmembrane Segments

نویسنده

  • Toshio Shimizu
چکیده

How reliable and useful are predictions of transmembrane segments(TMSs) of membrane proteins from the amino acid sequences? It remains still under debate. Kyte and Doolittle proposed a simple scheme for the prediction of TMSs [1]. It is based on the hydropathy plot and is widely accepted as a basic and standard method. Since then, a large number of more sophisticated predictive algorithms have been proposed, which are improved varieties of the Kyte-Doolittle's approach. Although these methods have been considered to give rather good results, their abilities are still not enough to predict the number and positions of TMSs precisely; they often give totally di erent predictive results with proteins having many TMSs, in particular [2, 3]. One reason for this situation can be attributable to the low quality of the information on TMSs described in general amino acid sequence databases. The information included within the SWISS-PROT database, for example, is mostly not based on any experimental evidence but on predicted models; there is often no explicit description about whether the data comes from experiments or calculations in databases. Higher quality of information on TMSs from experimental evidence only is essential to evaluate existing prediction methods more precisely and to develop an algorithm overcoming their problems. We have collected 128 references reporting the membrane topology of proteins, and are continuing our e orts to triple this number. From them, we selected 54 topology models based on experimental evidence, at least partially. Combining these data with the sequence information from the SWISS-PROT database, we have constructed a membrane protein database in the form of relational database. Current version includes 54 proteins which are classi ed into 3 1 õÆ ød ]MâQ Q m 036 ]M=£3e 3 2 ?E û» Çõž 9$õ Öj Ç`÷•Qõ m 444 Çõ=gâdecýR? 38 prokaryote (29) eukaryote (25) non-helical (3) Figure 1: Content of Membrane Protein Database. Number of data in parentheses groups (eukaryotic proteins, prokaryotic proteins, and the proteins with non-helical segments) as shown in Figure 1. Using this database we evaluated the predictability of the algorithms of following authors: Eisenberg [4]; Klein, Kanehisa and DeLisi(KKD method) [5]; von Heijne(TopPred method) [6]; and Persson and Argos [7]. The KKD method and the TopPred method predicted the exact number of TMSs for 59% and 67% of proteins in our database, respectively. These values could be increased to 63% and 74% by optimizing respective parameter values. The KKD method tends to predict fewer number of TMSs than the correct number, while the TopPred method shows the opposite tendency. We are now testing our previous idea to use di erent cut-o parameters for one TMS proteins and multiple TMS proteins in the KKD method and are also trying to develop a new predictive algorithm, by taking more precise position-dependent information on TMS into account.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A novel tool for the prediction of transmembrane protein topology based on a statistical analysis of the SwissProt database: the OrienTM algorithm.

OrienTM is a computer software that utilizes an initial definition of transmembrane segments to predict the topology of transmembrane proteins from their sequence. It uses position-specific statistical information for amino acid residues which belong to putative non-transmembrane segments derived from statistical analysis of non-transmembrane regions of membrane proteins stored in the SwissProt...

متن کامل

A Simple Method for Predicting Transmembrane Proteins Based on Wavelet Transform

The increasing protein sequences from the genome project require theoretical methods to predict transmembrane helical segments (TMHs). So far, several prediction methods have been reported, but there are some deficiencies in prediction accuracy and adaptability in these methods. In this paper, a method based on discrete wavelet transform (DWT) has been developed to predict the number and locati...

متن کامل

Construction of an Expression Plasmid (Vector) Encoding Brucella melitensis Outer Membrane Protein, a Candidate for DNA Vaccine

Background: DNA vaccination with plasmid encoding bacterial, viral, and parasitic immunogens has been shown to be an attractive method to induce efficient immune responses. Bacteria of the genus Brucella are facultative intracellular pathogens for which new and efficient vaccines are needed. Methods: To evaluate the use of a DNA immunization strategy for protection against brucellosis, a pla...

متن کامل

On the Prediction of Transmembrane Helical Segments in Membrane Proteins Based on Wavelet Transform

The prediction of transmembrane helical segments (TMHs) in membrane proteins is an important field in the bioinformatics research. In this paper, a new method based on discrete wavelet transform (DWT) has been developed to predict the number and location of TMHs in membrane proteins. PDB coded as 1KQG was chosen as an example to describe the prediction of the number and location of TMHs in memb...

متن کامل

On the Accuracy of Transmembrane Segment Prediction of Helical Integral Membrane Proteins

Integral membrane proteins play a vital role in a number of essential biological functions. Although abundant, about 30% of genes are known to code for membrane proteins, the number of solved structures in the pdb is less than 1%. Thus, structure prediction of membrane proteins is an essential tool for understanding their functions. A fundamental characteristic of the predicted structure is the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994